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Abstract. Bourgain's discretization theorem asserts that there exists a universal constant 
C G (0,00) with the following property. Let X,Y be Banach spaces with dimX = n. Fix 
D £ (1,00) and set S = e~" . Assume that A/" is a (5-net in the unit ball of X and that 
Af admits a bi-Lipschitz embedding into Y with distortion at most D. Then the entire 
space X admits a bi-Lipschitz embedding into Y with distortion at most CD. This mostly 
expository article is devoted to a detailed presentation of a proof of Bourgain's theorem. 

We also obtain an improvement of Bourgain's theorem in the important case when Y = Lp 
for some p G [l,oo): in this case it suffices to take 6 ~ C^^rr'^l'^ for the same conclu- 
sion to hold true. The case p = 1 of this improved discretization result has the following 
consequence. For arbitrarily large n € N there exists a family of ri-point subsets of 
{1, . . . ,ri}^ C such that if we write = then any L\ embedding of equipped 
with the Earthmover metric (a.k.a. transportation cost metric or minimumum weight match- 
ing metric) incurs distortion at least a constant multiple of -^/log log A ; the previously best 
known lower bound for this problem was a constant multiple of \/log log log A. 

1. Introduction 

If (X, dx) and (Y, dy) are metric spaces then the (bi-Lipschitz) distortion of X in F, 
denoted cy(X), is the infimum over those -D G [1, 00] such that there exists f : X Y and 
s G (0, 00) satisfying sdxix,y) ^ dyifix), f{y)) ^ Dsdxix,y) for all x,y ^ X. Assume now 
that X, Y are Banach space, with unit balls Bx, By, respectively. Assume furthermore that 
X is finite dimensional. It then follows from general principles that for every e G (0, 1) there 
exists S G (0, 1) such that for every 5-net A^^ in Bx we have cy{Afs) ^ (1 — e)cy{X). Indeed, 
set D = cy{X) and assume that for all /c G N there is a l/fc-net Mi/k of Bx and fk : J^i/k Y 
satisfying ||a; - y\\x < \\fk{x) - fk{y)\\Y ^ (1 - e)D\\x - y\\x for all x,?/ G M/fc- For each 
X G Bx fix some Zk{x) G Mi/k satisfying ||x — ^ 1/^- Let be a free ultrafilter on 

N. Consider the ultrapower y^, i.e., the space of equivalence classes of bounded F- valued 
sequences modulo the equivalence relation {xk)'^=i ~ {yk)'kLi hnifc-s.'g/ \\xk — ykWr = 0, 

equipped with the norm \\{xk)'kLi/ ~ ||y^y = limfc-s>f/ ||a;fc||y. Define f^/ : Bx Ya// by 
U{x) = ifkizkix))^^,)/ ~. Then \\x - y\\x < \\hA^) - My)\\Y^ ^ (1 " e)D\\x - y\\x 
for all x, y G A. By a (nontrivial) u;*-Gateaux differentiability argument due to Heinrich 
and Mankiewicz [H] it now follows that there exists a linear mapping Ti : A — )■ {Y^)** 
satisfying ^ ||Tia;||(y<^).. ^ (1 — £:/2)D||a;||x for all x E X. Since A, and hence 

also TiA, is finite dimensional, the Principle of Local Reflexivity [19] says there exists a 
linear mapping T2 : TiA 1^ satisfying |||/||(y<^)" ^ ||7'2l/||y<^^ ^ (1 + ^/^)\\y\\{Y^,)" for 
all y G TiX. By general properties of ultrapowers (see [13]) there exists a linear mapping 
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Ta : T2T1X ^ Y satisfying \\y\\y^ ^ \\T^y\\Y ^ (1 + £/3)||y||y^ for all y e TsTiX. By 
considering T3T2T1 : X — )■ F we have D = cy(X) ^ (1 — e/2){l + e/3)'^D, a contradiction. 

The argument sketched above is due to Heinrich and Mankiewicz [13]. An earlier and 
different argument establishing the existence of S is due to important work of Ribe [22]. See 
the book [S] for a detailed exposition of both arguments. These proofs do not give a concrete 
estimate on 6. The first purpose of the present article, which is mainly expository, is to 
present in detail a different approach due to Bourgain which does yield an estimate on 6. 
Before stating Bourgain's theorem, it will be convenient to introduce the following quantity. 

Definition 1.1 (Discretization modulus). For e G (0, 1) let 6x^y{£) be the supremum over 
those 6 G (0, 1) such that every 6-net A/5 in Bx satisfies cy(A/5) ^ (1 — e)cy(X). 

Theorem 1.1 (Bourgain's discretization theorem). There exists C G (0, 00) such that for 
every two Banach spaces X, Y with dimX = n < 00 and dimY = 00, and every e G (0, 1), 
we have 

5x^y(£)^e-("/^)''". (1) 

Theorem 11.11 was proved by Bourgain in [7] for some fixed Eq G (0, 1). The above statement 
requires small technical modifications of Bourgain's argument, but these are minor and all the 
conceptual ideas presented in the proof of Theorem 11.11 below are due to Bourgain. Readers 
might notice that our presentation of the proof of Theorem 11.11 seems somewhat different 
from [7], but this impression is superficial; the exposition below is merely a restructuring of 
Bourgain's argument. 

We note that it is possible to refine the estimate ([1]) so as to depend on the distortion 
cy(X). Specifically, we have the bound 

W(^)^e-(-W/^)"". (2) 

The estimate ([2]) implies ([1]) since due to Dvoretzky's theorem [12] cy(^2) — ^1 ^"^^ therefore 
cy{X) ^ y/n by John's theorem [16]. If we do not assume that dimy = 00 then we 
necessarily have dim!" ^ n since otherwise cy(X) = 00, making ([2]) hold vacuously. Thus, 
by John's theorem once more, cy(X) ^ n, and again we see that ([2]) implies ([T]). The 
proof below will establish ([2]), and not only the slightly weaker statement ([T]). We remark 
that Bourgain's discretization theorem is often quoted with the conclusion that if 5 is at 
most as large as the right hand side of (|2]) and A^ is a (5-net of Bx then Y admits a linear 
embedding into Y whose distortion is at most cy(A/5)/(l — e). The Heinrich-Mankiewicz 
argument described above shows that for finite dimensional spaces X, a bound on cy(X) 
immediately implies the same bound when the bi-Lipschitz embedding is required to be 
linear. For this reason we ignore the distinction between linear and nonlinear bi-Lipschitz 
embeddings, noting also that for certain applications (e.g., in computer science), one does 
not need to know that embeddings are linear. 

We do not know how close is the estimate ([T]) to being asymptotically optimal, though we 
conjecture that it can be improved. The issue of finding examples showing that 5x^y{£) must 
be small has not been sufficiently investigated in the literature. The known upper bounds 
on (5xc->y(e) are very far from ([1]). For example, the metric space (£", \/\\x — y\\i) embeds 
isometrically into L2 (see [H]). It follows that any 5- net in B^n embeds into L2 with distortion 
at most a/2/(5. Contrasting this with ci^i^i) = y/n shows that 6e^^L2{^) ^ 2/ ((1 — e)'^n). 
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It turns out that a method that was introduced by Johnson, Maurey and Schechtman [T7] 
(for a different purpose) can be used to obtain improved bounds on Sx^vi^) for certain 
Banach spaces Y, including all Lp spaces, p G [1, oo); the second purpose of this article is to 
present this result. To state our result recall that if z/) is a measure space and (Z, || • \\x) 
is a Banach space then for p G [1, oo] the vector valued Lp space Lp(i^, Z) is the space of all 
equivalence classes of measurable functions f : Q ^ Z such that = Jq WfVz^^ < ^ 

(and \\f\\L^{u,z) = esssup^gf^||/(u;)||y). 

Theorem 1.2. There exists a universal constant n G (0, oo) with the following property. 
Assume that 5, e G (0, 1) and D G [1, oo) satisfy 6 ^ ne"^/ {n^D) . Let X, Y he Banach spaces 
with dimX = n < oo, and let Ms he a 5 -net in Bx- Assume that cyiMs) ^ D . Then there 
exists a separahle prohahility space (fi, z^), a finite dimensional linear suhspace Z (1 Y , and 
a linear operator T : X — )■ Loo(z^, Z) satisfying 

Va; G X, ^-^\\x\\x ^ \\Tx\\l^(^^^z) ^ \\Tx\\l^^^,z) ^ (1 +£:)||x||x- 

Theorem II .21 is proved in Sectional as we mentioned above, its proof builds heavily on ideas 
from [17]. Because z/ is a probability measure, for all p G [1, oo] and all h G L^^u, Y) we have 
||^||Li(!^,y) ^ ll^lUpCi/.y) ^ ||^IUoo(i^,y)- Therefore, the following statement is a consequence of 
Theorem 11.21 

2 

^ ^ VtaTT ^ G [1, oo), Cy{Ms) ^ (1 - e)cL,iu,Y){X). (3) 

We explained above that if Y is infinite dimensional then cy{Ms) ^ \/n. It therefore follows 
from ([3]) that if Lj,(z/, Y) admits an isometric embedding into F, as is the case when Y = Lp, 
then Sx^ri^) ^ ne^ / {n^^"^). This is recorded for future reference as the following corollary. 

Corollary 1.3. There exists a universal constant k, G (0, oo) such that for every p G [l,oo) 
and e G (0, 1), for every n-dimensional Banach space X we have 

Sx^lM > (4) 

There is a direct application of the case p = 1 of Corollary 11.31 to the minimum cost 
matching metric on M^. Given n G N, consider the following metric r on the set of all 
n-point subsets of M^, known as the minimum cost matching metric. 

t{A, B) = min < \\a — /(a)||2 : f : A ^ B is a bijection 

Corollary 1.4. There exists a universal constant c G (0, oo) with the following property. 
For arhitrarily large n G N there exists a family ^ of n-point suhsets of {1, . . . ,n}^ C 
such that if we write \3/'\ = N then ci($^,r) ^ c^log log N. 

The previously best known lower bound in the context of Corollary 11.41 due to [20], was 
ci{W,t) ^ cyTogTogToglV. We refer to [20] for an explanation of the relevance of such 
problems to theoretical computer science. The deduction of Corollary 11.41 from Corollary 11.31 
follows mutatis mutandis from the argument in [SDl Sec. 3.1], the only difference being the 
use of the estimate (jlj) when p = 1 rather than the estimate ([I]) when Y = Li. 
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For an infinite dimensional Banach space Y define 

6n{Y) =^ inf {6x^Y (V^) • ^ is an n dimensional Banach space} , 

and set 

6n = ini{6n{Y) : F is an infinite dimensional Banach space} . 

Theorem 11.11 raises natural geometric questions. Specifically, what is the asymptotic behavior 
of 5„ as n — )■ oo? The difficulty of this question does not necessarily arise from the need 
to consider all, potentially "exotic", Banach spaces Y. In fact, the above discussion shows 
that fi(l/n5/2) ^ 6n{L2) ^ 0(l/n), so we ask explicitly what is the asymptotic behavior 
of Sn{L2) as n — J- oo? For applications to computer science (see e.g. [20]) it is especially 
important to bound so we also single out the problem of evaluating the asymptotic 

behavior of Sn{Li) as n ^ oo. Recently, two alternative proofs of Theorem 11.11 that work for 
certain special classes of spaces Y were obtained in [T8l 115] , using different techniques than 
those presented here (one based on a quantitative differentiation theorem, and the other on 
vector- valued Littlewood-Paley theory). These new proofs yield, however, the same bound 
as ([1]). The proof of Theorem 11.11 presented below is the only known proof of Theorem 11.11 
that works in full generality. 

Remark 1.2. The questions presented above are part of a more general discretization prob- 
lem in embedding theory. One often needs to prove nonembeddability results for finite spaces, 
where the distortion is related to their cardinality. In many cases it is, however, easier to 
prove nonembeddability results for infinite spaces, using techniques that are available for 
continuous objects. It is natural to then prove a discretization theorem, i.e., a statement 
that transfers a nonembeddability theorem from a continuous object to its finite nets, with 
control on their cardinality. This general scheme was used several times in the literature, 
especially in connection to applications of embedding theory to computer science; see for 
example [20], where Bourgain's discretization theorem plays an explicit role, and also, in 
a different context, [H]. The latter example deals with the Heisenberg group rather than 
Banach spaces, the discretization in question being of an infinitary nonembeddability theo- 
rem of Cheeger and Kleiner [8]. It would be of interest to study the analogue of Bourgain's 
discretization theorem in the context of Carnot groups. This can be viewed as asking for a 
quantitative version of a classical theorem of Pansu [21]. In the special case of embeddings 
of the Heisenberg group into Hilbert space, a different approach was used in [2] to obtain a 
sharp result of this type. 

Remark 1.3. A Banach space Z is said to be finitely representable in a Banach space Y if 
there exists K G [1, oo) such that for every finite dimensional subspace X C Z there exists an 
injective linear operator T : X — >■ F satisfying ||T||-||T~^|| ^ K. A theorem of Ribe [22] states 
that if Z and Y are uniformly homeomorphic, i.e., there exists a homeomorphism f : Z ^ Y 
such that both / and are uniformly continuous, then Z is finitely representable in Y 
and vice versa. This rigidity phenomenon suggests that isomorphic invariants of Banach 
spaces which are defined using statements about finitely many vectors are preserved under 
uniform homemorphisms, and as such one might hope to reformulate them in a way that is 
explicitly nonlinear, i.e., while only making use of the metric structure and without making 
any reference to the linear structure. Once this (usually nontrivial) task is achieved, one 
can hope to transfer some of the linear theory of Banach spaces to the context of general 
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metric spaces. This so called "Ribe program" was put forth by Bourgain in [6]; a research 
program that attracted the work of many mathematicians in the past 25 years, and has had 
far reaching consequences in areas such as metric geometry, theoretical computer science, and 
group theory. The argument that we presented for the positivity of (5x^y(e) implies Ribe's 
rigidity theorem. Indeed, it is a classical observation [TU] that if / : Z — t- F is a uniform 
homeomorphism then it is bi-Lipschitz for large distances, i.e., for every d G (0, oo) there 
exists L G (0, oo) such that L~^||x — y\\z ^ 11/(3:^) — /(l/)||y ^ L\\x — y\\z whenever x,y E Z 
satisfy ||a; — y\\z ^ d- Consequently, if X C Z is a finite dimensional subspace then d-nets 
in rBx embed into Y with distortion at most for every r > d. By rescaling, the same 
assertion holds for (5-nets in Bx for every 6 G (0, 1). Hence X admits a linear embedding 
into Y with distortion is at most 2L^. For this reason, in [7] Bourgain calls his discretization 
theorem a quantitative version of Ribe's finite represent ability theorem. Sufficiently good 
improved lower bounds on 6x^y{£) are expected to have impact on the Ribe program. 

2. The strategy of the proof of Theorem 11.11 

From now on (X, || ■ \\x) will be a fixed n-dimensional normed space {n > 1), with unit 
ball Bx = {x E X : \\x\\x ^ 1} and unit sphere Sx = {x & X : \\x\\x = !}• We will 
identify X with M", and by John's theorem we will assume without loss of generality 
that the standard Euclidean norm || ■ II2 on M" satisfies 

V X G X, ^^llxlU ^ \\x\\x ^ ll^^lh- (5) 
Jn 



Fix e,S E (0, 1/8) and let Afs be a fixed 5-net in Bx- We also fix Z) G (1, 00), a Banach 
space (Y, \\ ■ ||y), and a mapping / : A/5 — )■ Y satisfying 

Wx,ye Afs, ^\\x - y\\x ^ ||/(x) - /(y)||y ^ \\x - y\\x. (6) 

By translating /, we assume without loss of generality that f{N's) C 2By- Our goal will 
be to show that provided 5 is small enough, namely 5 ^ e"^^/*^-* there exists an injective 
linear operator T : X — )■ F satisfying ||T|| • ||T~^|| ^ (1 + 12e)D. 

The first step is to construct a mapping F : — )■ F that is a Lipschitz almost-extension 
of /, i.e., it is Lipschitz and on A/5 it takes values that are close to the corresponding values of 
/. The statement below is a refinement of a result of Bourgain [7]. The proof of Bourgain's 
almost extension theorem has been significantly simplified by Begun [1], and our proof of 
Lemma [2.11 below follows Begun's argument; see Section Ei 

Lemma 2.1. If 6 < ^ then there exists a mapping F : M" — )■ F that is differentiate almost 
everywhere on M", is differentiable everywhere on \Bx, and has the following properties. 

• F is supported on 3Bx- 

• \\F{x) -F{y)\\Y ^ 6\\x-y\\x for all x,y G M". 

• ||F(x) -F(y)||y ^ {l + s)\\x-y\\x for allx,y \Bx. 

• \\F{x) - /(x)||y ^ ^ for all x G Ms. 

In what follows, the volume of a Lebesgue measurable set ACM" will be denoted vol(A). 
For t G (0, 00) the Poisson kernel : — )■ [0, 00) is given by 

Cfit 



Pfix) 

{t^ + \\x\\l) 
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n + 1 
TT 2 , 



where c„ is the normahzation factor ensuring that Pt{x)dx = 1. Thus c„ = T (^) / 
as computed for example in [231 Sec. X.3]. We will use repeatedly the standard semigroup 
property Pt*Ps = Pt+s, where as usual / * g{x) = f{y)g{y - x)dx for f,g e Li(M"). 

Assume from now on that 5 < and fix a mapping F : M" — F satisfying the conclusion 
of Lemma |2.1[ We will consider the evolutes of F under the Poisson semigroup, i.e., the 
functions Pt*F : M" — F given by Pt*F{x) = J^^i Pt{y — x)F{y)dy. Our goal is to show that 
there exists to G (0, oo) and x e such that the derivative T = [PtQ*Fy{x) is injective and 
satisfies ||r|| ■ \\T-'^\\ ^ (1 + We)D. Intuitively, one might expect this to happen for every 
small enough t, since in this case Pt * F is close to F, and F itself is close to a bi-Lipschitz 
map when restricted to the 6-net Ms- In reality, proving the existence of to requires work; 
the existence of to will be proved by contradiction, i.e., we will show that it cannot not exist, 
without pinpointing a concrete to for which (Pj^ * F)'{x) has the desired properties. 

Lemma 2.2. Let fi be a Borel probability measure on Sx- Fix R,A& (0, oo) and m G N. 
Then there exists t G (0, oo) satisfying 

^ ^ t ^ A, (7) 



{R + l)™+i 
such that 

6vol(3Px 



daiPt*F)ix)\\Ydxdfi{a) ^ [ [ \\daiPiR+i)t*F)ix)\\Ydxdfi{a) + 



m 



Proof. If ([8]) fails for all t satisfying ([7]) then for every fcG{0,...,m + l}we have 



JSx -'K" 



x) dxdn{a) 



'Sx 

By iterating (|9]) we get the estimate 



> / / \\da{PA{R+iY-m*F){x)\\ydxdii{a)A —. (9) 

JSx J^" 



m 



I I \\da (^'A(i?+i)— 1 * F) {x)\\Ydxdii{a) 

J Sx J^" 

>[ [ pa (Pa(«.1) * F) (X)||, dxdM + ^("^ + l)^"K3i?x) ^ (^Q^ 
JSx "''K" ""^ 

At the same time, since F is different iable almost everywhere and 6-Lipschitz, for every 
a G we have ||(9aF||y ^ 6 almost everywhere. Since F is supported on 3Bx, it follows 
that 

/ pa *F){x)\\ydx = II (P^(^+i)-™-i * daF) {x)P dx 

/ / PAiR-,i)-r^-^ix-y)\\daF{y)\\ydxdy= [ \\daF{y)\\Ydy^Qvo\i3Bx). (11) 



If we integrate ( fTTl) with respect to /i, then since /i is a probability measure we obtain a 
contradiction to ffTOl) □ 
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In order to apply Lemma [2l2| we will contrast it with the following key statement (proved 
in Section Hj), which asserts that the directional derivatives of Pt * F are large after an 
appropriate averaging. 

Lemma 2.3. Assume that t E (0,1/2], Re (0, oo) and S E (0,£:/(4n)) satisfy 

£tlog(3/t) ^ 

and 

7^ ^^^32V^- ^^^^ 

Then for every x E ^Bx and a E Sx we have 

(\\d,(P,*F)\\y*Pnt)ix)^^. (14) 

We record one more (simpler) fact about the evolutes of F under the Poisson semigroup. 

Lemma 2.4. Assume that < t < 25^- Then for every x,y E \Bx we have 

\\Pt * F{x) - Pt * F{y)\\Y ^ (1 + 2e)\\x - y\\x. 

With the above tools at hand, we will now show how to conclude the proof of Theorem ll.il 
It will then remain to prove Lemma [2TT] (in Section[3]), Lemma [2l3] (SectionHI) and Lemma [2^ 
(also in Section Hj). 

Proof of Theorem \l.l[ Assume that 6 E (0, 1) satisfies 

. (15) 

where c = 300 (this is an overestimate for the ensuing calculation). Fix an {e/D)-net T in 
Sx with \T\ ^ (3-D/e)" (for the existence of nets of this size, see e.g. [H Lem. 12.3.1]). Let 
/i be the uniform probability measure on T . Define 



n+1 

e ) 



cD\ 



1. (16) 



Apply Lemma [2.21 with the above parameters, obtaining some t E (0, 00) satisfying 



such that 



V/ \mPt*F){x)\\ydx^y2 [ ||c?a(P(i^+l)t * F){x)\\ydx + ^ l-^I^^^Si^x) 



One checks that for 6 satisfying f|T5|) . R as in f|T6|) . and any t satisfying f|T71) . inequalities f|T2l) 
and f|T3l) are satisfied. Thus the conclusion f|T^ of Lemma 12.31 holds true for all a E Sx and 
X E \Bx. 

Note that by convexity we have for every a E Sx and almost every x E M*^, 

\\da{PiR+i)t * F){x)\\y = II {Pm. * {da{Pt * F))) {x)\\y ^ {\\da{Pt * F)\\y * Pm) [x). 
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Thus ||5a(Pt * F)\\y * Pm — \\da{P{R+i)t * F)\\y ^ 0, so we may use Markov's inequality as 
follows. 



vol {x e ^-Bx : {\mPt*F)\\Y*Pm) {x) - ||9„(P(r+i), * F){x)\\y ^ ^ 

^ -([ {i\\da{Pt*F)\\y*Pnt){x)-\\da{P{R+l)t*F){x)\\Y)dx 

\\da{Pt * F){x)\\Ydx - [ \\da{PiR+l)t * F)(x)||ydx) . (19) 



D 

e 

Hence, 



vol e ^Bx : 3a eJ^, (||a,(Pi * F)||y * Pm) (x) - ||9,(P(R+i)t * F){x)\\y ^ | 
^ -[y, \\da{Pt*F){x)\\Ydx-y2 \mPiR+i)t*F){x)\\Ydx 



^ D 6|J^|vol(3Px; 



e m 



aV" ./I 



Consequently, there exists x G |-Bx satisfying 

Va G ^, {\mPt * F)\\y * PRt) (x) - \mPiR+i)t * F){x)\\y < ^. (20) 

But we already argued that ( |T4l) holds as well, so ( l20l) implies that 

VaG^, ||9,(P(«+i)i*F)(x)||y^i^. (21) 

Note that by dH]) and ([IT]) we have (P + l)t ^ (^/(cP'))" < e/ (25v^). Hence, if we define 
T = (P(^+i)(*P)'(x) then by LemmaEaiwe have ||T|| ^ 1+2^. By (EID, ||Ta||y ^ (l-2e)/P 
for all a G J-". For z G take a G J-" such that ||z — a||x ^ e/D. Then, 

||T.|| ^ llTall - ||T|| • Ik - a|U ^ - (1 + 2e)^ ^ 

HenceTisinvertibleand ||T-i|| ^P/(l-4e). Thus ||T|| ■ ||T-i|| ^ i±||P ^ (1 + 12£)P. □ 

3. Proof of Lemma EH] 

We will use the following lemma of Begun [1] . 

Lemma 3.1. Let K (1 be a convex set and fix T,r],L G (0,oo). Assume that we are 
given a mapping h : K + tBx -> Y satisfying \\h{x) — h{y)\\Y ^ L {\\x — y\\x + v) /^"^ o.^^ 
x,y e K + tBx- Define H : K ^ Y by 



r"vol(Px) JrB^ 
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Then \\H{x)-H{y)\\Y ^ L (l + ^) \\x - y\\x for allx,yeK. 

We refer to |1] for an elegant proof of Lemma 13.11 The deduction of Lemma 12.11 from 
Lemma Ism is via the following simple partition of unity argument. Let {0p : M" — [0, l]}pg_v'« 
be a family of smooth functions satisfying J^peMs ^p(^) ~ ^ ^ ^ and (j)p{x) = 

for all {p,x) G A/5 x with ||x — ^ 25. A standard construction of such functions 
can be obtained by taking a smooth : — )■ [0, 1] which is equals 1 on Bx and vanishes 
outside 2Bx, and defining ippi^) = — p)/S) for {p,x) G Afs x M". If we then write 
A/^ = {pi,P2, • • • ,Pn}, define = Vpi and = i>p. UlZli^ - V^pJ for 3 ^ {2, • • • , A^}- 
Then '^p^j^^ (pp = 1 — YlpeAf^i^ ~ ^p) = 1 on -Bx since every x G i?x satisfies ||x — p\\x ^ 5 
for some p E Ms. 

Now define (7 : Bx Y hj g{x) = J2pej\fs 'Pp{^)f{p)- Setting f3{t) = max{0, 2 — t} for 
t G [0, 00), consider the mapping h : R" — )■ Y given by 



h{x) = 

Observe that if x, ?/ G Bx then 
- h{y) = g{x) - g{y) = 



g{x) if X G Bx, 

\x\\x)g{x/\\x\\x) ifxGM"\Sx. 



(22) 



peJVsn{x+2SBx) qeAfsn{y+25Bx) 

J2 M^)My) Ifip) - fm ■ 

pGAfsn{x+25Bx) 
q(iAfsniy+25Bx) 

This identity implies that 

yx,yeBx, \\h{x)-h{y)\\y ^\\x-y\\x + ^5 (23) 
If X G Bx and y G M" \ i?x then using /(A/^) C 2i?y and the fact that /3 is 1-Lipschitz, 



||Mx)-%)||y ^ 

y 



9[x) -g 



y 



\y\\x 



y\\x)) 



Y 
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y 



\y\\x 



Y 



X 



\y\\x 



+ A6 + i\\y\\x - 1) sup \\fip)\\y ^ \\x - y\\x + 3(||y||x - 1) 



Since \\y\\x — 1 ^ — y\\x + ll^^llx — 1 ^ — y\\x, it follows that 

Vx G G M" \ Bx, \\h{x) - h{y)\\y ^ 4(||x - y\\x + 5). 

If X, y G W \ fix then 



\\h{x)-h{y)\\y ^ 



9 



X 



m\x 

l|23)l 



X 



IZ/||x 

y 



x\\) + 



9 



y 



\y\\x 



X 



45. 



(24) 



y\\x)\ 



\^\\x 



\y\\x 



+ 45 + 2 ||x - y\\^ ^ 4(||x - y\\x + 6). (25) 



X 



Set r = 2n5/e G (0, 1/2) and define for x G M" 

F(x) ^ 



r"V0l(fix) JrBx 



h{x - y)rf?/. 



(26) 
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It follows from the definition (122|) that h is different iable almost everywhere on M"; in fact, 
it can only be non-differentiable on Sx U {2Sx)- Since h is differentiable on Bx \ 5"^ and 
r G (0,1/2), it follows from (!26|) that F is differentiable almost everywhere on M", and is 
differentiable everywhere on ^Bx- Clearly F is supported on {2 + t)Bx C 3Bx, i.e., the first 
assertion of Lemma [2.11 holds. Due to fl23l) . flM|) . fl25|) . an application of Lemma [3.11 with 
A' = R'^, L = 4 and r] = 6 shows that F is 4 (1 + e/2)-Lipschitz on M", proving the second 
assertion of Lemma 12.1 [ Due to fl23|) . an application of Lemma 13.1 1 with K = [1 — t)Bx 
shows that F is (1 + e)-Lipschitz on (1 — r)Bx ^ \Bx- This establishes the third assertion 
of Lemma Em To prove the fourth assertion of Lemma [2.11 fix x G Ms. Then, 

|[23tA|[24ll 



Also, 



\\F{x)-h{x)\\y^-—^^j^ \\h{x - y) - h{x)\\y dy ^ 4(r + 5). (27) 



\\h{x) - f{x)\\y ^ Yl - f(P)\\yM^) ^ max i|/(x) - fip)\\y ^ 26. (28) 

Recalling that r = 2n5/e, the fourth assertion on Lemma [2TT] follows from ([27j) and (128!) . □ 

4. Proof of Lemma 12.41 and Lemma 12.31 
We will need the following standard estimate, which holds for all r, t G (0, oo). 



Pt{x)dx ^ (29) 

To check f[29|) . letting s„_i denote the surface area of the unit Euclidean sphere 5*""^, and 
recalling that Pt{x) = t~^Pi{x/t), we have 



m f r 

Pt{x)dx ^ / Pt{x)dx = / Pi{x)dx 



I ~ „, n+i '^'^ ^ 



r/t (1 + S^) 2 Jr/t 



S 



2 



It remains to recall that Cn = T (^^) /vr 2 and s„_i = n7r2/r (| + l) (see e.g. [3l Sec. 1]), 
and, using Stirling's formula, to obtain the estimate c„s„_i ^ y^2n/TT. 

Another standard estimate that we will use is that for every y G we have 

/ \P,{x)-Ptix + y)\dx^^^^^. (30) 

Since Pt{x) = t~"Pi(x/t) it suffices to check (!30!) when t = 1. Now, 

f \Pi{x) - Pi{x + y)\ dx = I I {VPi{x + sy),y)ds dx ^ \\y\\2 [ || VPi(x)||2 cix 
Jr" Jr" Jo Jr"- 

= (n + l)cn / ^dx = {n + l)c„s„_i / TT^dr = c„s„_i, 

JR" (1 + \\x\\l) 2 Jo (1 + H) 2 

where we used the fact that the derivative of r^~^^ /{I + r^)^ equals (n + l)r"/ (1 + r^)~2^. 
The required estimate ([301) now follows from Stirling's formula. 
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Proof of Lemma \2.4\ We have, 
\\Pt*F{x)-Pt*F{y)\\Y ^ I Pt{z)\\F{x-z)-F{y-z)\\ydz 

((1+^) / Pt{z)dz + Q [ Pt{z)dz] \\x-y\\x 

^ {l + e + 24:t^/n)\\x-y\\x, 
where in (*) we used the fact that F is (1 + £:)-Lipschitz on |-Bx and 6-Lipschitz on M". □ 
Lemma 4.1. For every t G (0, 1/2] and every x G Bx we have 

'3~ 



\\Pt * F{x) - F{x)\\y ^ Sv^tlog 

Proof. Since F is supported on ?>Bx-i 

\\Pt*F{x)-F{x)\\y 

^[ \\F{y-x)-F{x)\\yPt{y)dy+\\F{x)\\y [ Pt{y)dy. (31) 

Since F is 6-Lipschitz and it vanishes outside 3-Bx, we have ||F(x)||y ^ 18. Moreover, if 
\\y — x\\x ^ 3 then \\y\\x ^ \\x — y\\x — \\A\x ^ 2, and therefore 



F{x)\\y / Pt{y)dy^l^ / Pi(y)rf2/ ^ 9tv^. (32) 



ll29t 

Pt{y)dy 

To bound the first term in the right hand side of (!3T]) note that if — ^ 3 then 
lbl|2 ^ ynllyllx ^ 4A/n. Moreover, \\F{y - x) - F{x)\\x ^ &\\y\\x ^ 6||y||2. Hence, 



I \\F{y - x) - F{x)\\yPt{y)dy f \\y\\2Pt{y)dy 



'x+3Bx J Wvh^'iy/n 

= 6t/ ^\\y\\2Pi{y)dy = tCnSn-i z-^^+rd'S (33) 

J\\y\\2^±^ Jo (1+S2) — 

Direct differentiation shows that the maximum of s"'/(l + s'^)~^ is attained when s = a/tz, 
and therefore 5^^/(1 + s^)^ ^ mm{l/y/en, 1/s} for all s G (0, oo). Hence, 

—ds ^ 1 + / - = 1 + log . (34) 



o\ "-+1 ^ / \ ( / i • 

l + s2— iv^s ^ We/ 



The required result now follows from substituting ( l32l) . (!33l) . (!34|) into (!3T1) . and using the 
fact that t ^ 1/2 and c„s.„_i ^ y/n. □ 



Proof of Lemma \2.3l Define 



Q ^ 100Dv/^tlog(3/t) ^ ^g^^ 
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For w,y E \Bx let p,q E Ms satisfy \\p — w\\x, Wl — vWy ^ ^- Assume that \\w — y\\x ^ ©• 
Using the third and fourth assertions of Lemma I2.H together with Lemma I4.H we have 

\m * F){w) - (P, * F){y)\\y ^ \\f{p) - f{q)\\y - \\F{p) - f{p)\\y - \\F{q) - f{q)\\y 

-\\F{w) - F{p)\\y - \\F{y) - F{q)\\y - ||(P, * F){w) - F{w)\\y - ||(P, * F){y) - F{y)\\y 



® \\p-q\\y 18n5 _ ^ , /3 

^ 11^ ^\\y _ 2(1 16v^tlog f - 

^ 2(1 + ejd - levntlog I - 



l-£/3„ 

^ — — \\'^-y\\x, 



(36) 

where ( 136|) uses the assumptions — ^ © and f|T2|) . 

Note that the second inequahty in f|T2l) imphes that ^ 1/4. Therefore, since ||a||jic = 1 
it follows from f p6|l that for every z G jBx, 



l-e/3 
D 



^ \\Pt * F{z + Oa) - Pt * F{z)\\y 







da{Pt* F){z + sa)ds 



y JO 







da{Pt*F){z + sa)\\yds. (37) 



Since in the statement of Lemma [2.31 we are assuming that ^ 1/8 

1 .e 







\da{Pt*F){x + sa- 
m l-e/3 



D 



iE3 1 - 5/3 [ 
yPRt{y)dyds ^ — - — / PRt{y)dy 

il-8RtVE)%^^'-^/'^i'-^/'h'-^. (38) 



D 



D 



Since F is 6-Lipschitz, ||(9aP||y ^ 6 almost everywhere, and therefore ||9a(Pj * P)||y ^ 6 
almost everywhere. Hence, 



r [ \\da{Pt*F){x-y)\\y{PRt{y + sa)-Pnt{y))dyds 
Jo Jr" 



Rt 



^ 6 1 \Pnly + so) - Pm{y)\dyds ^ ■ — ^ ^ — • (39) 

We can now conclude the proof of Lemma 12.31 as follows. 



\da{Pt*F)\\y*PRt){x) = ^ f [ \\da{Pt*F){x + sa-y)\\yPRt{y)dyds 

^ Jo Jm." 



1 r 



ilMliAiisgi I — £ 

\daiPt*F){x-y)\\yiPRtiy + sa)-PRtiy))dyds ^ □ 



5. Proof of Theorem 11.21 
The following general lemma will be used later; compare to [T71 Prop. 1]. 
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Lemma 5.1. Let {V^ || ■ ||v) he a Banach space and U = (M", || ■ \\u) be an n- dimensional 
Banach space. Assume that g : Bu V is continuous and everywhere differentiable on the 
interior of Bu. Then 

1 



yo\{Bi 



g'(u)du 



(40) 



Proof. We may assume without loss of generality that vol(i?[/) = 1. Fix y G with 
\\y\\2 = 1- For every u E y^ H {Bjj \ Su) there are unique a„, fe„ G M satisfying < 6„ and 
\\u + auy\\u = \\u + buy\\u = 1- Hence, 









/ 9\u){y)du 




I t 




V 


Jy^nBu Jau 



d 

ds 



g{u + sy)dsdu 



{g{u + buy) - g{u + Ouy)) du 



'y^nBu 



Loo{Su) 



yo\n^i{y^nBu). (41) 



Let K be the convex hull of (y-*- H Bu^ U Then K is a. subset of the intersection of 

Bu with one of the two half spaces corresponding to the hyperplane y^. Since vol(i?;7) = 1, 

voln-iiy^ n Bu) 



f^\\y\\u 



vo\{K) ^ 



(42) 



The desired estimate (HOjl follows from substituting the upper bound on vol„_i(y-'- fl Bu) 
that follows from (USD into (gB. □ 



Fix e, (5 G (0, 1/2) and let A/5 be a 6-net in Bx ^ M". Fixing also D G (1, 00), assume that 
/ : A/5 — )■ Y satisfies ||x — y\\x/D ^ \\f{x) — f{y)\\Y ^ H^; — y\\x for all x,y E Ms- Define 
Z = span (/(A/^)). Thus Z is a finite dimensional subspace of Y. Assume that 

^2 



5 ^ 



(43) 



Since consequently 5 < e/{4:n), there exists F : X Z that is differentiable everywhere and 
satisfies the conclusion of Lemma 12.11 Let u be the normalized Lebesgue measure on |-Bx 
and define a linear operator T : X — > Lao{v: Z) by 



{Ty){x) = F'{x){y). 
Since F is (1 + £:)-Lipschitz on \Bx we have the operator norm bound 

\\T\\x^Loo{v,Z) ^ 1 + 

Theorem 11.21 will therefore be proven once we show that for all y G X we have 



(44) 



D 



\y\\x ^ \\Ty\\L,[v,z) 



\F\x){y)\\Ydx. 



(45) 



vol {\Bx) JlBx 

To prove ( H5l) . let J : X — )■ £00 be a linear isometric embedding. By the nonlinear Hahn- 
Banach theorem (see e.g. [5l Ch. 1]) there exists a mapping G : Z ^ ioo satisfying 

Vx G ATs, G{f{x)) = J{x) (46) 

and G is _D-Lipschitz; we are extending here the mapping Jo {f~^\f(j\fs)) ■ /(A/^) — ?■ £00 while 
preserving its Lipschitz constant. By convolving G with a smooth bump function whose 
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integral on Y equals 1 and whose support has a small diameter, we can find H : Z ^ 
with Lipschitz constant at most D and satisfying 



WzeF{Bx), \\H{z)-G{z)\U^ <: 



nD5 



Define a linear operator S : Liiy, Z) — £00 by setting for h G Li{v, Z), 



Sh 



H'{F{x)){h{x))dv{x). 



(47) 



(48) 



Since H is Z^-Lipschitz and i/ is a probability measure, we have the operator norm bound 

\\S\Wiu,z)^i^^D. (49) 
By the chain rule, for every y & X we have 

ST{y) ®=® / H\F{x)){F'{x){y))diy{x) = I {Ho F)' {x){y)dv{x). (50) 

Note that if y G A/5 then 

\\H{F{y)) - Jy\U^ 



diet 



\\H{F{y))-G{fmU^ 
^ \\H{F{y)) - G{F{y))\\,^ + ||G(F(y)) - G(/(y))||,, 



63 nD5 
< 



D\\F{y)-f{y)\\ 



Y 



e 

nD6 ^ 9n6 10nD6 
^ + D ^ , 



e 



e 



(51) 



where in the penultimate inequality in fl5T]) we used the fact that ||-F(2/) — f{y)\\Y ^ 9n6/e 
for all y G A/5, due to Lemma [2711 If x G \Bx then there exists y G JVs fl (jBx) satisfying 
ll^; — y\\x ^ 2(5. Using the fact that iJ o F is (1 + £:)D-Lipschitz on it follows that 

\\H{F{x)) - Jx\U^ ^ WHiFiy)) - Jy\U^ + ||i/(F(x)) - HiF{y))\U^ + \\Jx - Jy\U^ 

m 10nD6 ,^ ^^ 15nD5 
^ + 1+e £'■25 + 25^ . 52 

e e 

3o, II ■ 11(7 = 2|| ■ ||x and g = H o F — J, it follows from flS2]) that 



By Lemma [5. II with V 
\\ST — J||x^ 



[if oF)'(x)di^(x) - J 



30n^D6 61 

e 



(53) 



It follows that for all ?/ G X, 

l [49ll | [53t 

I?||T|/|U,(.,z) ^ ||5Ty||,^^||Jy||,^-||5T-J|U^,^-||y|U ^ (l-e)lbllx. 
This concludes the proof of ( l45l) . and hence the proof of Theorem 11.21 is complete. 



□ 
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